A Workload-Driven Approach for View Selection in Large Dimensional Datasets
نویسندگان
چکیده
منابع مشابه
Fast SFFS-Based Algorithm for Feature Selection in Biomedical Datasets
Biomedical datasets usually include a large number of features relative to the number of samples. However, some data dimensions may be less relevant or even irrelevant to the output class. Selection of an optimal subset of features is critical, not only to reduce the processing cost but also to improve the classification results. To this end, this paper presents a hybrid method of filter and wr...
متن کاملMaterialized View Selection for Multidimensional Datasets
To fulfill the requirement of fast interactive multidimensional data analysis, database systems precompute aggregate views on some subsets of dimensions and their corresponding hierarchies. However, the problem of what to precompute is difficult and intriguing. The leading existing algorithm, BPUS, has a running time that is polynomial in the number of views and is guaranteed to be within (0.63...
متن کاملA new approach for sustainable supplier selection
Recently, sustainable supply chain management (SSCM) has become one of the important subjects in the industry and academia. Supplier selection, as a strategic decision, plays a significant role in SSCM. Researchers use different multi-criteria decision making (MCDM) methods to evaluate and select sustainable suppliers. In the previous studies, evaluation is solely based on the desirable feature...
متن کاملA Clustering Scheme for Large High-Dimensional Document Datasets
Scalability and high dimensionality are two common problems associated with document clustering. We present a novel scheme to deal with these problems. Given a set of documents, we partition the set into several parts. We use one part and cluster the constituent documents into groups. By the obtained groups, we reduce the number of features by a certain ratio. Then we add another part, cluster ...
متن کاملA Method for Analyzing High-dimensional Datasets[A first approach]
Due to technological advances to massively collect data, the last decades have raised several challenges for data analysis. Data reduction is one of the most important. Typically such reduction can be made in two ways: decrease the number of instances (elements) of the dataset and decrease the number of required attributes (columns) to describe each of these instances. In the last case, methods...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Network and Systems Management
سال: 2020
ISSN: 1064-7570,1573-7705
DOI: 10.1007/s10922-020-09526-z